Data Structure Description

نویسنده

  • Graham Cormode
چکیده

Streaming algorithms aim to summarize a large volume of data into a compact summary, by maintaining a data structure that can be incrementally modified as updates are observed. They allow the approximation of particular quantities. The AMS Sketch is focused on approximating the sum of squared entries of a vector defined by a stream of updates. This quantity is naturally related to the Euclidean norm of the vector, and so has many applications in high-dimensional geometry, and in data mining and machine learning settings that use vector representations of data. The data structure maintains a linear projection of the stream (modeled as a vector) with a number of randomly chosen vectors. These random vectors are defined implicitly by simple hash functions, and so do not have to be stored explicitly. Varying the size of the sketch changes the accuracy guarantees on the resulting estimation. The fact that the summary is a linear projection means that it can be updated flexibly, and sketches can be combined by addition or subtraction, yielding sketches corresponding to the addition and subtraction of the underlying vectors.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wage Inequality in Developing Countries: An Examination of New Economic Geography Theory

The purpose of this study is to investigate the factors affecting the manufacturing industry wage among selected developing countries based on new economic geography theory. More specifically, we use a panel data model to study the spatial structure of wages in 136 countries for the period 1998-2007. The results indicate that this theory provides a good description of the spatial structure of w...

متن کامل

Description of the ovarian follicle maturation of the migratory adult female bulatmai barbel (Luciobarbus capito,Güldenstädt 1772) in captivity

The study aimed to investigate the maturation process of ovarian follicles and ovary structure of migratory form of female Bulatmai barbel (Lucioarbus capito). The histology of oogenesis coincided with that known from most teleosts. The ovarian structure was found to be cytovarian. The development of the oocytes is started from early May along with spawning and the degeneration of matured oocyt...

متن کامل

Description of the ovarian follicle maturation of the migratory adult female bulatmai barbel (Luciobarbus capito,Güldenstädt 1772) in captivity

The study aimed to investigate the maturation process of ovarian follicles and ovary structure of migratory form of female Bulatmai barbel (Lucioarbus capito). The histology of oogenesis coincided with that known from most teleosts. The ovarian structure was found to be cytovarian. The development of the oocytes is started from early May along with spawning and the degeneration of matured oocyt...

متن کامل

An efficient CAD tool for High-Level Synthesis of VLSI digital transformers

Digital transformers are considered as one of the digital circuits being widely used in signal and data processing systems, audio and video processing, medical signal processing as well as telecommunication systems. Transforms such as Discrete Cosine Transform (DCT), Discrete Wavelet Transform (DWT) and Fast Fourier Transform (FFT) are among the ones being commonly used in this area. As an illu...

متن کامل

محاسبه ثابت جفت‌شدگی ساختار ریز و تانسور g رادیکال‌های آلانین در دماهای مختلف بلور بر پایه نظریه تابعی چگالی

In this paper, Density Functional Theory (DFT) was utilized for the calculation of the hyperfine coupling constant and the g tensor alanine radicals at different crystal temperatures. The cluster approach was used for considering the effects of crystal environment. In the cluster approach, the careful selection of the cluster size is very important for the geometry structure of alanine and the ...

متن کامل

A Method to Reduce Effects of Packet Loss in Video Streaming Using Multiple Description Coding

Multiple description (MD) coding has evolved as a promising technique for promoting error resiliency of multimedia system in real-time application programs over error-prone communicational channels. Although multiple description lattice vector quantization (MDCLVQ) is an efficient method for transmitting reliable data in the context of potential error channels, this method doesn’t consider disc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014